Towards open-set text recognition via label-to-prototype learning
نویسندگان
چکیده
• Formulating a new open-set task requires spotting and cognizing novel characters. Proposing framework that handles characters without retraining. fast rectification technique for text recognition. Scene recognition is popular research topic which also extensively utilized in the industry. Although many methods have achieved satisfactory performance close-set challenges, these lose feasibility scenarios, where collecting data or retraining models could yield high cost. For example, annotating samples foreign languages can be expensive, whereas model each time when “novel” character discovered from historical documents costs both resources. In this paper, we introduce formulate demands capability to spot recognize A label-to-prototype learning proposed as baseline task. Specifically, introduces generalizable mapping function build prototypes (class centers) seen unseen classes. An predictor then reject according prototypes. The implementation of rejection over out-of-set allows automatic unknown incoming stream. Extensive experiments show our method achieves promising on variety zero-shot, close-set, datasets.
منابع مشابه
Named Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملLabel embedding for text recognition
The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs from this bottom-up view. In our approach, every label from a lexicon is embedded to an Euclidean vector space. We refer to this...
متن کاملTowards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings
Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to lack of directly supervised data. We propose here a method that learns to assign MRs to a wide range of text (using a dictionary of more than 70,000 words, whic...
متن کاملRODRIGUEZ-SERRANO, PERRONNIN: LABEL EMBEDDING FOR TEXT RECOGNITION 1 Label embedding for text recognition
The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs from this bottom-up view. We propose to embed word labels and word images into a common Euclidean space. Given a word image to b...
متن کاملLearning a Neural-network-based Representation for Open Set Recognition
Open set recognition problems exist in many domains. For example in security, new malware classes emerge regularly; therefore malware classication systems need to identify instances from unknown classes in addition to discriminating between known classes. In this paper we present a neural network based representation for addressing the open set recognition problem. In this representation insta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2023
ISSN: ['1873-5142', '0031-3203']
DOI: https://doi.org/10.1016/j.patcog.2022.109109